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ABSTRACT 


.)  This  paper  considers  the  simplest  model  of  change-point  in  which  at  most 
one  change  in  the  mean  may  occur.  Results  include: 

1)  Introduction  of  a  test  for  the  null  hypothesis  that  *no  change  in 
the  mean  occurs  ,  and  the  limit  distribution  of  the  test- stati Stic. 

2)  Approximate  calculation  of  the  power  of  the  test, 
i)  Interval  estimation  of  the  position  of  change. 

4)  Point  estimation  of  the  jump  at  the  point  of  change  and  its 
asymptotic  distribution. 

5) 1  Evaluation  of  the  bias  of  the  MLE  of  error  variance. 
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TESTING  AND  INTERVAL  ESTIMATION  IN  A  CHANGE- 
POINT  MODEL  ALLOWING  AT  MOST  ONE  CHANGE 

Xiru  Chen 

1.  INTRODUCTION 

Consider  the  model 

X(t)  =  f(t)  +  e(t),  0  <  t  <  1  (1) 

where  f(t)  is  a  non-random  function  of  the  form 

a ,  0  1  t  1  tg 

f(t)  =■ 

a+e,  tQ<t<l 

» 

with  a,  e,  tQ  unknown  and  0  <  tQ  <  1.  tg  is  called  the  "change-point" 

(of  the  function  f,  or  of  model  (1)),  and  0  the  jump  at  the  change- 

point.  e(t)  is  a  random  variable  whose  distribution  function  F  is 

independent  upon  t.  We  assume  that  E(e(t))  =  0  for  t  £  [0,  1],  and 

2 

e(t)  possesses  a  finite  variance  a  „ 

We  desire  to  make  inference  on  tg  and  e,  using  observations  made 

on  {X(t)}„  In  this  paper  we  assume  that  these  observations  are  taken  on 
equal-paced  t  values.  Specifically,  we  observe  X(i/n),  i  =  l,...,n. 

Note  that  X( i/n)  depends  upon  both  i  and  n,  but  for  simplicity  of  notation 
in  the  sequel  we  write  X^  instead  of  X(i/n).  We  assume  that  Xj,..,,Xn 

2 

are  independent.  These  assumptions  and  notations,  such  as  a,  e,  0  ,  F 
are  valid  throughout  this  paper,  and  will  not  be  mentioned  latter. 

Model  (1),  the  so-called  AMOC  (Allowing  at  Most  One  Change),  is  the 


simplest  model  for  change-point.  As  an  important  and  extensively  studied 
model,  there  exists  a  huge  literature  on  it,  and  we  refer  to  Krishnaiah 
and  Miao  (1986),  CsErgo  and  Horvath  (1986)  for  a  detailed  survey  of  this 
subject » 

The  methodology  of  the  present  paper  is  nonparametric  in  nature, 
in  that  we  do  not  resort  to  likelihood  and  thus,  the  normality  condition 
can  be  dispensed  with.  The  work  borrows  an  idea  advanced  by  Yin  (1986). 
This  idea  proposes  to  search  the  possible  change-point  by  comparisons 
made  locally*  The  method  has  the  merit  that  the  resulted  statistic 
in  testing  the  null  hypothesis 

Hq  :  8=0  (i.e.  no  change-point  exists)  (3) 

has  a  simple  asymptotic  distribution.  This  not  only  facilitates  the 
testing  of  HQ,  but  also  is  convenient  in  estimating  the  power  of  test, 
and  in  constructing  a  confidence  interval  for  the  change-point  tQ. 

These  problems  will  be  studied  in  Section  2,  Section  3  and  Section  5, 

with  different  assumptions  on  F*  Section  6  is  devoted  to  a  brief 

discussion  about  the  statistical  inference  on  the  jump  e.  Section  4  is 

2 

related  to  the  estimation  of  variance  a  . 
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20  F  IS  NORMAL  WITH  A  KNOWN  VARIANCE 


In  this  section  we  consider  the  case  that 


F-j  N(0,a  ) 


(4) 


2  . 


and  a  is  known.  The  method  is  based  upon  the  following  theorem. 


Define 


THEOREM  1.  Suppose  that  Xj,...,X  are  independently  and  identically 
distributed  (iid)  with  a  common  distribution  N(a,o2).  Let  £  =  f  be 
a  positive  integer  such  that 


1  im 

n-*» 


1 


l/n  =  °*  15  (logn)2/f  =  0 


m-f 


(5) 


m 

_  l  X.  -  l  X.  ,  m  =  2 C,  2(  +1 , 

m  m-e+1  1  m-26+1  1 


C_  =  maxf | Y  |  :  m  =  2f,2L+l,... ,n} 


m 

Ap(x)  =  [21og(3n/2C-3)]'1/2(x+21og(3n/2(-3)  + 


(6) 


(7) 


oglog( 3n/2( -3)  - 


iAn<*»  ’  exp(-2e-x). 


—  OO  X  < 


Hog-)  (8) 
(9) 


Proof „  Construct  a  Standard  Brownian  Motion  fW(t),  t  >  0}  such 


W(||)  =^2?(X1+00.+Xm-ma)/a,  m  =  1,2,... 


and  put 


Z(t)  =  3'1/2[W(t)-2W(t+3/2)  +  W(t+3)] ,  t  >  0 


Then  it  is  seen  =  oZ(3m/2f-3) ,  m  =  2f,  2f+l,...  .  Put 


(10) 


Cn  =  sup{  I Z(t )  I  :  0  <_  t  £  3n/2£-3},  nn  =  ?n  ' 

We  show  that 

lim  np  /I ogn  =  0,  a,s.  (11) 

In  fact,  we  have 

0  <  n  <  -i  sup{|W(t+s)  -  W(t)|  :  0  <  s  <  3/2-£,  0  <  t  <  (3n-3)/2£-31 

”  n 

Insert  T  =  3n/2 1  -  3,  h  =  3/2 £,  e  =  1  and  u  =  (/?/4)/£/‘logn  6 

-in  lemma  1.2,1  of  Csorgo  and  Revesz  (1981),  we  get 

P( r^/logn  >_  6)  <  C  n  exp(-62£/241ogn)  (12) 

Here  6  >  0  is  given  and  C  is  a  constant  not  depending  on  n.  Since 
£/(logn)2  -*  (li)  follows  from  (10)  and  Borel-Cantelli  Lemma. 

Z(t)  is  a  stationary  Gaussian  process,  with  EZ(t)  -  0,  and 


P(t)  =  Cov(Z(t) ,  Z(t+x) )  - 


f  i  -  M » 

!-l  + 1  t  ! /3 , 


0  <  I T i  <3/2 
3/2  <  |t|  <  3 
3  <_  |  T  |  <  ® 


Thus,  the  conditions  of  a  theorem  of  Qualls  and  Watanabe  (1972)  are  met 
and  we  have 


^  P(fn5  An(*))  =  exp(-2e"x)  (13) 

For  n  large  we  have 

An(x+Ax)  -  An(x)  >  Ax//Zlogn,  ax  >  0, 
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Hence 

p(-n  I  An(x- 1  ax j ) )  -  P(nn  >  |  Ax|//2'logn) 

1  p(  Va  -  Vx^ 

1  p(cn  <.  An(X+|Ax()  +  P(nn  >  |  AxJ/VZlogn)  (14) 


For  n  large0  From  (12)  -  (14),  letting  n ->■  =*and  then  ax  -*•  0,  we 
get  (9).  Theorem  1  is  proved. 

Theorem  1  suggests  a  way  to  test  the  hypothesis  (3):  For  the  chosen 
size  a,  0  <  a  <  1,  solve  the  equation  exp(-2e~x)  =  1  -  a,  the  solution  is 

x(a)  =  -log(-ilog(l-ct)) 


Calculate  d  =  2£/n,  Cn(a,d)  =  An(x(a)),  and  reject  the  null  hypothesis 
(3)  when  and  only  when 


Sn  >  a  Cn(a,d) 


(15) 


Under  condition  (3),  this  test  has  an  asymptotic  size'  a  as  the  sample 
size  n  tends  to  infinity. 

We  give  an  estimate  of  the  power  e(e)  =  3n(e,a)  of  this  test. 

For  this  purpose,  let  r  be  the  integer  satisfying 


r/ n  <  tQ  <  (r+1 )/n 


(16) 


Then  Yr+^  -  N ( /tfl  e,o^).  Therefore, 


B(0)  ^  p( I Yr+£|  >  a  Cp(a,d)) 

=  t(  M^C  (a,d))  +«(-M^.  C  (a,d)) 

/la  n  /la  n 

>  *(M^  .  C  (a,d)) 

/la  " 


(17) 


V.V 


-v'.v.v, 


.V.VAW.VAlV 
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where  <f>  is  the  distribution  function  of  N(0,1)°  From  this  inequality 
it  is  seen  as  is  obvious  intuitively,  that  for  gaining  a  larger  6(e), 
one  should  give  t  a  larger  value. (17)  Also  suggests,  as  is  again 
obvious  intuitively,  that  6(e)  increases  steadily  with  le|/a,  the 
magnitude  of  the  jump  measured  against  the  standard  deviation  o. 
Practically,  the  presence  of  a  change-point  is  of  importance  only  when 
the  ratio  |e|/'a  is  substantially  different  from  zero. 

Now  consider  the  interval  estimation  of  the  change-point  tg.  The 
existence  of  tg  may  be  a  fact  known  in  advance,  but  usually  it  is 

evidenced  by  the  rejection  of  the  null  hypothesis  (3).  Regardness  which 
is  the  case,  we  adopt  the  following 

Rule.  Find  integer  k  such  that  |Yk)  =  cp.  Take  [(k-2£)/n, 

k/n]  as  the  confidence  interval  of  tQ. 

The  length  of  this  interval  is  d  =  2l/n,  Therefore,  to  enhance 

the  accuracy  of  this  estimate,  one  should  choose  a  small  l.  But  there 
are  other  things  to  be  considered.  First,  if  the  existence  of  tg  is  to  be 
decided  by  the  test  defined  above,  a  small  i  increases  the  risk  of 
false  acception  of  the  hypothesis  (3),  when  in  fact  it  is  untrue, 

Second,  a  small  l  corresponds  a  low  confidence  coefficient  of  the 
interval .  To  be  more  specific,  we  give  an  estimate  of  the  confidence 
coefficient  y  of  this  interval,  as  follows: 


Since  X  ,  X  X  ,  are  iid  when  m  6 (r,  r+2£),  by  Theorem  1  we  have 

m  m-1  m-2t-l 


where  the  probability  P(?n  <_  a  Cn(a,d))  is  computed  under  the  condition 
that  X^,.OJ,Xn  are  iid.  Hence,  neglecting  the  error  in  the  latter 
equality  of  (18),  we  have 

y  >_  ( 1-a)  +  P(|Yr+^|  >_  a  Cn(ct,d))  -  1 

>  $(M^  -C(a,d))-a  (19) 

“  ft  a  n 

and  this  inequality  suggests  that  y  should  increase  with  |e|/a. 

Using  (19),  we  can  give  the  following  important  question  an  approxi 
mate  solution:  Form  a  confidence  interval  of  tg  with  prescribed 

length  dQ  and  confidence  coefficient  1  -  aQ.  The  question  is  how 

to  choose  l  and  n.  To  do  this,  give  a  in  (19)  the  value  aQ/2,  and 

solve  the  equation 

<f>(je!  SZ/[J2  a)  -  Cn(a,d))  =  1  -  ciq/2 

to  obtain 

t  •,  2(M)-2(Cn(^,d0)  ♦  Uv/,  n  2€/d0  .  (20) 


Here  U&  ^i s  the  upper  -  point  of  N ( 0 , 1 ) 


The  solution  (20)  has  the  trouble  that  it  involves  the  parameters 
e  and  6,  the  former  is  surely  unknown  and  the  latter  is  usually  unknown 
In  some  cases  it  may  be  feasible  to  take  some  preliminary  samples  to 
give  a  crude  estimation  of  them,  but  we  recommend  the  foliowing  proce¬ 
dure:  Decide  by  practical  consideration  a  constant  M  such  that  only 
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1 

'A 


■A 


>? 

a 

ij 

H 

n 

*ri!< 


when  | 0 | /a  >_  M,  the  change-point  tg  is  of  any  real  importance. 

Replace  |e|/a  in  (20)  by  this  M. 

For  example,  take  aQ  =  0,05,  dQ  =  0.1,  M  =  2,  (20)  gives 

l  =  19,46,  n  =  389.2 

The  result  seems  rather  good  as  it  is  comparable  with  the  sample  size 
which  is  needed  in  estimating  the  probability  p  of  the  binomial 
B(n,p).  The  sample  size  needed  to  guarrantee  a  confidence  interval  of 
p  not  longer  than  0,1  and  with  a  confidence  coefficient  not  smaller 
than  0.95  is  roughly  384.  So  when  |e|/a  is  not  smaller  than  2,  which 
seems  a  moderate  requirement  in  practice,  we  are  in  a  situation  com¬ 
parable  to  one  seeking  an  estimation  of  the  binomial  p. 


9 


3.  F  IS  NORMAL  WITH  VARIANCE  UNKNOWN 
2 

When  the  variance  a  is  unknown,  we  use  the  sample  Xn 

“2  ?  A 

to  estimate  it_  Denote  by  a  the  estimator  of  a  ,  we  use  a  to  replace 
a  in  (15)  to  perform  the  test.  In  order  that  the  resulted  test  still 
has  an  asymptotic  size  a,  the  estimator  a  must  satisfy  certain  condi¬ 
tion  stated  in  the  following  theorem. 

THEOREM  2.  Suppose  that  the  conditions  of  Theorem  1  are  met,  and 

A2  2 
o  is  an  estimator  of  a  satisfying 

*9  o  P 

liml  a  -  cr  jlogn  =  0  (21) 

n-x»'  I 

£  means  convergence  in  probability.  Then 

P U/o  <  fln(x»  =  exp(-2e"x)  (22) 


Proof  follows  easily  from  Theorem  1.  For,  from  (21)  we  have 
*  P 

| a  -  a|logn— ^0.  Thus,  for  arbitrarily  given  e  >  0  we  have 
PUn/cj  <_  An(x)  (1-e/logn) )  -  P(|  a  -  ojlogn  >_  e) 

<  P(£n/o  <  An(x) ) 

* 

<_  P(cn/o  <_  An(x) (1+e/logn))  +  P(|a  -  a|logn  >_  e ) 

Given  6  >  0.  For  n  large  we  have 

An(x-< S)  <  An(x)(l-s/logn)  <_  An(x)(l+e/logn)  <  An(x+6) 
Therefore,  for  n  large  we  have 


J 


Via 

VjJg 

I 

>1 

v 


I 


M 
$ 


m 

;? 

3 

r*A' 

1 

•I*! 


a 

I 


P(Cn/a  <  Ap(x-6;)  -  P(  i a  -  ajlogn  >_  e) 


<  P(?  /a  <  A  (x)) 
-  v^n  -  nv  " 


I  P^-n/a  1  An(x+5))  +  pUa  "  aM°9n  1  0 

Letting  n  -*■  ®  and  then  6  -*•  0,  we  obtain  (21) 

2 

Estimation  of  the  variance  a  „ 

Under  the  normal  assumption  here,  the  maximum  likelihood  estimator 
has  the  form  (26)  given  below,  The  following  theorem  shows  that  this 
estimator  satisfies  (21). 

THEOREM  3.  Suppose  that  {N^Ng,...}  is  a  sequence  of  positive 

integers  tending  to  infinity.  For  each  n,  there  are  given  Nn  indepen¬ 
dent  variables  X  .  i  =  1,...,N  such  that 
ni  »  n, 

Xni  -  F(x) ,  i  =  l,...,mn,  Xni  „  F(x-e),  i  =  mn+l,...,Nn 

where  F  is  a  distribution  function  whose  moment  of  (2+<5)-th  or^der 

exists,  e  is  a  constant.  Define 

N  N„ 


n 

Xnlc  *  pni/C' 

yn2c  =  l*  ni^Vc)* 

I  =  l  X  ./N 
n  j-  m  n 

(23) 

1 

9  C 

4  *  p„rx 

Nn 

nlc)2  + 

c  =  1 ,. . . ,Nn 

(24) 

1 

S2(n) 

=  min  |S2c  :  C  =  1,...  ,N^j 

» 

(25) 

a2  =  a2  =  S2(n)/Nn 

(26) 
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m 

Kl 

,**v 

wj 

•i 


$ 

$ 

V,»T 


i 

i 


m 


«w 


Then  we  have, 

1  im, *2  2,-,  p  n 

o_  -  o  logN  r  0 
it**1  n  1  3  n  = 

where  ^  is  the  variance  of  F. 


Proof.  For  simplicity  of  writing  we  shall  in  the  sequel  omit  the 

—  —  2  2 

symbol  n  in  all  notations.  Thus,  Xni ,  m  ,  Nn>  Xnlc,  Xn,  Snc,  S  (n) 

a2  —  —  2  2  A2 

and  aj;  will  be  abbreviated  to  X^ ,  m,  N,  Xjc,  X,  Sc,  S  and  a 

respectively.  Put  Y.  =  X^  -  EX^. ,  T..  =  Y^  +  ...  +  Y^ .  Given  e  >  0, 

from  Kolmogorov  inequality  we  have 


P ( | T  /C|  i  e/logN,  Ne/logN  <  c  <  N)  >  1  -  D(logN)7N 


p(  i  (tn“Tc)/(N“c)  I  1  e/logN,  1  <  c  <  N(l-e/!ogH)ll  -  D(logN)4/N  (29) 

Here  and  in  the  following  we  shall  use  D  to  denote  a  constant  not 
depending  on  n,  which  may  assume  different  values  in  each  of  its 
appearance.  From  (28),  (29),  we  have 

P(|Tc/c|  <.  e/logN,  j(TN-Tc)/N-c)|  i  e/logN  : 


Ne/logN  <  c  <  N(l-e/logN) )  >  1  -  D(logN)^/N. 


First  consider  the  case  0=0.  Since 

c2  c2  _  C(N-c)  /—  v  >2  i. 

Sc  "  SN - (Xlc  X2c}  U 

and  Xlc  -  X2c  =  Tc/C  -  (TN-Tc)/(N-c)  when  e  =  0,  We  have  by  (30' 

and  (31)  that 
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P(S2-S2  >  -  Ne2/(logN)2,  Ne/logN  <  C  <  N(l-e/logN)) 
C  N  —  — 


>  1  -  D(logN)  /N 


For  C  <  Ne/logN  we  have 


S?  -  s2n  i  -  l  ,  '  <V*2c> 


Ne/logN 


v 


Ne/logN  0  „  ,  1  N  ? 

1-2(1  ■  X1  +  TogN^1"  ToglT^ 


For  C  N(l-e/logN)  one  obtains  similarity 

N  ,  N 


$2  _ 

N 


-il 


x;  + 


N(l-e/logN)  1  ^ 


lo'gN 


r‘iJ9 


From  (32)  -  (34),  and  putting 
2  «  ,  N 


Ne/logN 


We  have 


P(|a2-S2/N|logN  >  Q)  <  D(logN)4/N 
Denoting  by  a  the  expectation  of  variable  Xj,  we  have 

1  im  P(Q  <  ~-(o2+a2+ 1))  -  0 
n-*»>  —  1-e 


(32) 


(33) 


(34) 


q  =  e  +  2c.  1  £  x2  +  2e  +  \  X2)  (35) 

W  TogN  HN{  1  Ne  x  1  N(l-e/logN)  1 


(36) 


(37) 


Further,  it  can  easily  be  shown  that  under  the  conditions  of  the  present 
theorem,  there  exists  S'  >  0  such  that 


1im  NS' (o2-S2/N)  P  0 
p-*®  .  N  * 


(38) 


From  (36)  -  (38),  (27)  follows 


Then  one  can  easily  verify  that 

-(c-m)CX2c-02,  c  >  m 

-(m-c)(Xlc-c)2,  c  <  m 

Given  e>0  small  enough.  Write 

J  =  {c  :  Ne/logN  ^  c  N(l-e/logN) 

and  separate  the  following  four  cases: 

1.  C  €  J. 

As  in  the  case  of  0  =  0,  we  have  (33)  and  (34) 

2.  C  6  J,  N1/3  <  | c-m]  <  N7/12 

By  Kolmogorov  inequality,  we  have 

P(  y2c-(a+0)|  <  e/logN,  C  6  J)  >  1  -  D(logN)4/N  (40) 

P(|S-(a+e)|  <_  c/logN,  N173  <  c  -  m  <  N7712)  >_  1  -  D(logN)2N~1712  (41) 
P( | c-a|  <  e/logN,  N1/3  <  m  -  c  <  N7/12)  >  1  -  0(logN)2N' “1/12  (42) 

From  (40),  (41),  and  the  first  part  of  (39),  we  get 

P(Sc~Sm  -  '  4e2N7/12/(logN)2,  c  G  J,  N1/3  <  c  -  m  <  N7/12) 

>  1  -  D(logN)2N"1/12 


(43) 


The  case  of  c  <  m  can  be  handled  by  (40)  (42),  and  the  second  part  of 

(39^  resulting  in  an  inequality  similar  to  (43).  Combining  the  two,  we 
obtain 

P(S2-S2  -  "  4e2N7/12/(logN)2,  c  C  J,  N1/3  <  |c-m|  <  N7/12) 

>  1  -  D(logN)2N‘1/12  (44) 

3o  C  €  J,  | c-m|  >  N7/12 
Kolmogorov  inequality  gives 

P(|c-(a+e)|  i  e/logN,  c  -  m  >  N7/12)  >  1  -  D(logN)2N'1/6  (45) 

P(|?-a|  <_  e/logN,  m  -  c  >  N7/12)  >  1  -  D(logN)V1/6  (46) 

From  (40),  (45)  and  the  first  part  of  (39),  we  get  an  inequality 
similar  to  (43),  Likewisely,  from  (40),  (46)  and  the  second  part  of 
(39),  we  get  another  inequality.  Combine  those  two,  we  get 

P(S2-S2  >  -  4e2N/(logN)2 ,  c6J,  |c-m|  >  N7/12) 

>  1  -  D(logN)2N~1/6  (47) 

4„  cCJ,  j  c-m|  <  N1/3 

For  c  m,  we  have  by  the  first  part  of  (39) 

sc  -  Sm  I  "  2N1/30^.+c2)  1  -  2N1/3(^+^n1/3x2)  (48) 

From  (40)  we  have  p(  *2c\  <  j  a  |  +  |  e  |  +  1 ,  c  6  J)  >  1  -  D(logN)3/N, 
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m+N1/3  ,  !  /o  ,,c 

P(£  x?  1  N1/2)  1  1  -  Dff1/6 
m 

Hence,  from  (48)  we  get 

P(S2  -  S2  >  -  N6/7,  c6J,  m  <  c  <  m  +  N1/3)  >  1  -  DN"1/6 
'em-  —  —  — 

This  combining  with  the  similar  inequality  obtained  for  the  case  of  c  <  m, 
gives 

P(S2  "  Sm  -  '  n6/?’  c6J*  lc-ml  -  n1/^  -  1  “  DN"1/6 

Combine  this,  (33),  (34),  (44),  (47),  define  Q  as  before,  and  notice 
?  2  2 

that  SM  >  S  >  S  ,  we  obtain 
N  —  m  — 

2  2  -1/12 
P(|S^/N-a  |logN  >  Q)  <  D(logN)  N  (49) 

Similar  to  the  case  of  0  =  0,  here  we  can  still  prove  that  there  exists 

a  constant  R  not  depending  on  n,  such  that  lim  P(Q  >_  Re)  -  0.  Also, 

n-x» 

under  the  conditions  of  the  present  theorem,  we  can  find  S'  >0  such  that 
N  |o  -  Sm/N|  •+  0  in  probability.  These  facts,  combined  with  (49),  give 
(27),  and  the  theorem  is  proved. 


s’3 

M 


4.  BIAS  OF  THE  VARIANCE  ESTIMATOR 

2  2  2 

Maintain  the  notations  of  section  3.  Since  S  =  min{Sc>  <  S^,  and 

*2  m2  2 

ES^/(N-2)  *  c2,  we  see  that  even  if  we  replace  a  by  o  =  S  /(N-2), 

82  still  underestimates  a2.  Intuitively  it  seems  clear  that  the  bias 
should  be  more  serious  if  |e| /a  is  small..  So  we  propose  to  look  in  some 
detail  the  bias  in  case  that  6-0. 

From  (31)  we  have 

Vsn-s2>  3  i<c<n  dta'VR  V2 *  [-hv^i/to]2  A  % 

where  Tc  =  £^(X^-a)/a,  and 


ZN(t)  » 


(T[(N+l)t]  '  [<N*l)t]TN/N)//FT,  0  <  t  «  1 


t  =  1 


0 

When  is  normally  distributed,  or  more  generally,  when  E j X ^ |  ®  fo> 

some  <5  >2,  the  following  asymptotic  distribution  is  valid  (Yao  and 
David  (1984),  Csorgo  and  Horvath  (1986)): 

^  P(^log2n  /njj’  -  (21og2n+ilog3n-^logiO  <  x)  *  exp(-2e"x) 

Here  log^(x)  =  loglog^(x),  log^x  =  logx.  From  this,  we  deduce  the 
following  asymptotic  distribution  for  the  bias 


S2-S2 
11mn/  N  b 


P( — 2 —  <  21og2N+log3N-logiT+x)  =  exp(-2e"  '  ) 


mmmm 


This  asymptotic  result  suggests  that  the  bias  has  a  form 
S2  -  S2  =  2log2N  +  log3N  +  Qp(l) 

-x/2 

Further,  since  the  distribution  function  exp(-2e  )  has  an  expecta¬ 
tion  2y  +  21og2  =  2.5  +  (y  -  the  Euler  constant),  (50)  suggests  that 
when  N  is  large,  we  may  take 

S2/(N-1-(21  og2N+log3N-log7r)  -  2(y+log2))  =  S2/(N-21og2N-log3N-2  4) 

2 

as  the  estimator  of  a  ,  in  order  to  bring  down  the  bias. 

In  some  applications  we  might  know  in  advance  that  there  exist  some 
constants  A^,  x^,  0  <  A^  <  <  1,  such  that  the  change-point  tQ  lies  in 

the  interval  [a^,a2L  In  this  case  we  may  use 

S2(Aj,A2)  =  min(S2  :  A^N  <_  c  <_  AgN) 

2 

instead  of  S  .  We  have 

-^(S2-S2(a1,a2))  =  sup  |ZN(t) |//t(l-t)  i  vN 
a  A 

It  is  shown  (Csorgo  and  Horvath  (1986))  that  as  N  -  »>,  v^  converges  to 

v  =  sup(  |  V  ( s )  |  :  0<_S<^-|log(A2(l-A^)/A1(l-A2))),  where  (V(s)  :  S  >_  0  > 

is  the  Ornstein  -  Uhlenbeck  process,  i.e.,  a  Gaussian  process  with  mean 
zero  and  covariance  function  exp(-|t-s|).  The  distribution  function  of 
v  has  been  tabulated  by  DeLong  (1981).  Put  E(v)  =  C(Aj,a2),  it  is 

reasonable  to  take  S  (a^,a2)/(N-1-C(a^  a2))  as  an  estimator  of  a  . 

An  upper  bound  of  C(a^,a2)  can  be  estimated  in  the  following  way. 


^(s2-S2(xux2))  <  tnaxj^yd^  Tn)2  :  AN  <  c  <  (l-x)Nj 


r 


.  1  ;  HO  A  1  1  T  —  —  T  I 

-TO^iy  J_XN<c<(l-A)N^  c  "  ^  N  'n1 


■  ^N<C<_(1-X)n'  B(t)  1  X(l-A) 


L 


J 


Where  {B(t),  0  <  t  <  1}  is  the  Standard  Brownian  Bridge. 


The  distribution  function  of  q < t < 1 1 B ( t ) |  is  well-known  as 

L  I  1  0  0 

1  -  l°°  2(-l)  1exp(-2k^x^).  From  which  it  is  easy  to  get 


E|o™<i'B(t)l 


I 


1  — T  +  _V  — T  +  •  •  •**  ^0.85 
2^  3^  r 


Hence , 


C(X1  x2)  <  0.85  X"1(l-x)'1 


Therefore,  if  we  use 

a2(X1>X2)  =  S2(X1jA2)/(N-2) 

2 

to  estimate  a  ,  the  relative  bias  does  not  exceed 

[a2-Ea2(x1  X2)]/o2  =  (0„85  a'^I-a)'1  -  l)/(N-2)  (51) 

For  example,  take  x^  =  1  -  x2  =  0.1,  This  value  can  perhaps  be  considered 

as  small  eno-'gh  for  many  applications.  Take  N  =  100.  According  to  (51), 
the  relative  bias  does  not  exceed  8.6%,  which  is  reasonably  good,  con¬ 
sidering  the  relatively  small  sample  size  100  in  estimating  the  variance 
of  a  complicated  model. 


5.  F  IS  NON- NORMAL 


When  the  distribution  of  the  random  error  e(t)  is  non-normal ,  we  can 
use  the  theory  of  strong  approximation  of  partial  sums  of  iid.  variables 
by  Brownian  Motion  Process  to  give  extensions  of  Theorem  1  to  non-normal 
cases.  In  this  way  the  methods  of  previous  sections  can  still  be 
appl ied. 

THEOREM  4.  Replacing  the  assumption  of  Theorem  1  that 
Xj—  N(a,o2)  by 

E(exp(tX^))  <  °°  for  |t|  small  enough  (52) 

Then  the  conclusion  of  Theorem  1  remains  valid. 

Proof.  Put 

Sk  =  {X1  +  "*  +  xk  -  kaV°>  k  =  1*2,... 

According  to  Komlcs  and  others  (1975,  1976),  there  exists  a  Brownian 
Motion  process  (W(t),  t  >  0}  such  that 


1  im  sup  sup  | c 
n~  k<n  ISk-M(k)|/logn 


Since 


Va  =  ^Sm"Sm-P  “  (Sm-£'Sm-2£^//27 

We  have  for  m  <  n 

|  Ym/a-[W(m)-2W(m-e)+W(m-2£)]//2l|  <4  Sk-W(k  •  I//27 


(54) 


According  to  (53),  noticing  that  logn/vT-^  0  as  n  -*•  <*>,  we  get 
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•lit 


art 


•‘IVi 

■>!w 


^Ov/0-[W(m)-2W(m-f)  +  W(m-2£)]//2I|]  =  0,  a.s.  (55) 


From  Theorem  1 ,  we  have 


lim  P  {sup([W(m)-2W(m-£)  +  W(m-2£)]//2I  :  m  =  2£,.,,,n)} 


<_  An(x)  =  exp(-2e"x),  -  «  <  x  <  « 


Finally,  (7)  follows  from  (55)  and  (56).  Theorem  4  is  proved. 

Since  under  the  assumption  (52)  the  conclusion  of  Theorem  3  is  also 
true,  it  follows  that  the  method  of  the  previous  two  sections  applies 
to  the  case  in  which  (52)  is  true. 

The  condition  ensuring  the  asymptotic  results  in  sections  2,  3,  can 
further  be  substantially  weakened. 

2 

THEOREM  5.  Replacing  the  assumptions  of  Theorem  1  that  N(a,o  ) 

p 

and  (logn)  /l  ■+■  0  by  :  For  some  6  >  2 


E|Xj |  < 


lim  2/6/z  =  Q 
n-*» 


Then  the  conclusion  of  theorem  1  remains  valid. 

The  proof  parallels  that  of  Theorem  4,  with  the  help  of 
another  result  of  Kimlos  and  others,  which  asserts 


0,  a.s. 


under  condition  (57) 


6.  ESTIMATION  OF  JUMP  e 


To  form  a  point  estimate  of  the  jump  e  at  the  change  point  tg, 

we  suggest  the  following  procedure: 

1.  Find  k  such  that  |Y^|  =  5  , 


Compute 


6  =  en 


1  ?  X  1 

^  k+1  1  ' 


k-2 1 

\  xi 


0  is  taken  as  an  estimate  of  e. 


(59) 


When  k  =  21,  or  k  =  n,  e  is  not  defined.  In  general,  if  k  is 
too  near  21  or  n,  this  would  imply  that  the  change-point  tg  is  too 

near  0  or  1,  and  the  samples  at  our  disposal  are  perhaps  not  enough  to 
give  a  reasonable  estimate. 

For  an  interval  estimation  of  9,  we  prove  the  following  asymptotic 
theorem  about  e. 

THEOREM  6,  Suppose  that  the  5-th  order  moment  of  distribution  F 
is  finite  for  some  5  >  2.  Further,  e  +  0  and  l  satisfies  (58) 
and  1/ n  -*■  0.  Then  as  n  -*■  °°  we  have 

(/nt^T-t^)/o)(e-0)-^>  N(0,1)  (60) 

where  — L->  stands  for  convergence  in  law. 

Proof.  Without  losing  generality,  we  assume  a  =  0,  a  =  1  (see 
section  1).  Using  Theorem  5  and  slightly  modifying  the  argument  of  section  2 
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p(nt0  1  k  i  ntQ  +  21)  =  1 


(61) 


Define  =  [ntQ  -  2£] ,  =  n  -  [ntQ  +  2£]  -  1 ,  and 

nl 

T  =  /sonj  (~  l  (xi  -  •  tt:  \  XiS 

n  0  u  \n2  n-n9+l  1  1  > 


(62) 


Since  £/n  -  0,  0  <  tn  <  1,  we  have  as  n  -  • 


Tn— N(0,1 ) 


(63) 


Now, 


n-k-n0  n 


Tn  -  {;-6)  =  nJo+ilX1’e) 


n-n. 


k-2*-nj  _1 


2 

k-2£ 


ik  (xi-6) '  \h*KH  {  xi] 


Therefore, 


Sup{ 


{ | Tn  -  /nt0('i'-t0_)(9-e)!  :  ntQ  <  k  I  ntQ  +  2£} 


<  C  jn“3/z£.Sup(|J  (Xi-e) |  :  j  in  -  n2  +  1) 


n-n„ 


+  n'1/2Sup(|  l  (X.-e)|  :  [ntQ]  +  1  <  j  I  n  -  n2) 


+  n"3^  £.Sup(|^j  X.|  :  1  1  3  1  nl^ 

1  4 

n-1/2Sup(|Xj  +1Xil  :nj  +  l<j  1  [ntQ])j  4  .I^ni 


(64) 


where  C  is  a  constant  not  depending  on  n,  Since  £/n-*0, 
Xi*****Xn  are  independent  with  variance  1,  EX..  =  9  for 


i  >  n  -  +  1,  it  follows  from  Dosker's  Theorem  that 


Similarily, 


0, 


as  n 


(65 


Ip2  — — *  0,  as  n  oo  (55 

Now  define  X..  =  +  e  for  1  <_  i  £  [nt^],  X..  =  X..  for  [ntQ]  +  1 

i  <_  n.  Given  arbitrarily  e  >  0,  we  have  by  Donsker's  Theorem 

n-n2 

Ip2  <  n“1/2  Sup(|  l  (X.-e)  :  [n(tQ-e)]  <_  j  <_  n  -  n2) 

J 

»  Sup(  !w(t)|:tQ-e<_t<_tQ),  as  n 

- »  0,  a.s.  ,  as  e  -*•  0 


Hence, 


Simil arly 


Summing  up 


(65)  -  (68) ,  we  get 


as 


n  -*■  » 


n  -*•  » 


(67 

(68 


Sup(|Tn-  /nt0(i-t0)(9-9)|  :  ntQ  <  k  <_  ntQ  +  2 1)  0 

as  n  -*■  co.  From  this  and  (61),  (63),  we  obtain  (60).  The  proof  is 


concl uded. 
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It  is  easy  to  see  that  tg  =  ( k-£)/n  is  a  consistent  estimator  of 
tg  (of  course,  when  e  /  0  and  thus  tg  is  well-defined).  Earlier  we 

A  A 

have  introduced  a  consistent  estimate  a  of  a.  Substituting  tg 

A 

for  tg  and  o  for  a.  We  get  the  following  theorem: 

THEOREM  7.  Suppose  that  the  conditions  of  Theorem  6  are  met,  we 


(/ntg(l-tg)/a)(e-e)  N(0,I) 


as  n  -*■  °° 


When  0  =  0,  in  which  case  tg  has  no  meaning,  the  statistic  tQ 

is  still  well-defined.  It  is  not  known  whether  or  not  (69)  is  true  for 
0=0.  So  Theorem  7  cannot  be  employed  to  make  tests  for  the  hypothesis 
e=0,  but  (69)  can  be  used  to  form  a  confidence  interval  of  e,  when 
9/0  is  assumed  in  advance,  or  as  a  result  of  the  rejection  of  null 
hypothesis  0=0. 

Of  course,  if  is  normally  distributed,  or  more  generally, 
satisfies  condition  (52),  then  the  condition  (58)  in  Theorem  6  and  7  can 

p 

be  weakened  to  (logn)  fl  0, 
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